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RECOMBINANT DNA SEQUENCES, VECTOPS CONTAINING THEM 
AND METHOD FOR THE USE THE REOF _ _ 

The present, invention relates to recombinant 
DNA sequences, vectors containing them, and a method 
for the use thereof. In particular, the present 
invention relates to recombinant DHA sequences which 
encode the complete amino acid sequence of a 
glutamine synthetase (GS) (L-glutamate ; ammonia 
ligase [ADP-formingj; EC 6.3.1.2) and to the use of 
such nucleotide sequences. 

The ability of cloned s«snes to function when 
introduced into host cell cultures has proved to be 
invaluable in studies of gene expression. It has 
also provided a means of obtaining large quantities 
of proteins which are otherwise scarce or which are 
completely novel products of gene mar.ipu 1 At i on . It 
is advantageous to obtain such proteins from 
mammalian cells since such proteins are generally 
correctly folded, appropriately modified and 
completely functional, ofter. in marked contrast to 
those proteins as expressed in bacterial cells. 

Where large amounts of product are required, 
it is nececsary to identify cell clones in which the 
vector sequences are retained curing cell ^ 
proliferation. Such stab3e vector maintenance can 
be achieved either by use of a viral replicon or as 
a consequence of integration of the vector into the 

host cell's DNA. 

Where the vector has been integrated into the 
host cell's DNA, the copy number of the vector DNA, 
and concomitantly the amount of product which cculd 
be expressed, can be increased by selecting <or cell 
lines in which the vector sequences neve been 
amplified after integration into the host cell's DNA. 
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A known method for carrying out such a 
selection procedure is to transfer* a host cell with 
a vector comprising, a DNA sequence which encodes an 
ensymc which is inhibited by a known drug. The 
vector may also comprise a DMA sequence which 
encodes a desired protein. Alternatively the host 
cell may be co- transformed with a second vector 
which comprises the DMA sequence which encodes the 
desired protein. 

The transformed or co-transformed host cells 
are then cultured in increasing concentrations cf 
the known drug hereby selecting drug-resistant 
cells. It has been found that one common mechanism 
leading to the appearance of mutant cells which can 
survive in the increased concentrations of the 
otherwise toxic drug is the over-production of the 
enzyme which is inhibited by the drug. This nost 
commonly results from increased levels of its 
particular &RNA P which in turn is frequently caused 
by amplification of vector DNA and hence gene copies. 

Xt has also been found that/ where drug 
resistance is caused by an increase in copy number 
of the vector DNA encoding the inhibitable enzyme, 
there is a concomitant increase in the copy number 
of the vector DNA encoding the desired protein in 
the host cell* s DMA . There is thus an increased 
level of production of the desired protein. 

The most commonly used system for such 
co-amplif icaticn uses as the enzyme which can be 
inhibited dihycrof olate reductase (DKFR) . This can 
be inhibited by the drug methotrexate < MTX ) . To 
achieve co-amplification, a host cell which lacks an 
active gene which encodes DKFR is either transformed 
with a vector which comprises DHA sequences encoding 
DHFR and a desired protein or oo- transformed 
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with a vector comprising a DNA sequence encoding 
DHFR and a vector comprising a DNA sequence encoding 
the desired protein. The trahs formed or 
co- transformed host cells are cultured in media 
containing increasing levels of MTX, and those cell 
lines which survive are selected > 

Other systems for producing co-amplif i cation 
have been employed . However, none of them has been 
as widely used as the DHPR/MTX system. 

The co-amplification systems which are at 
present available suffer from a number of 
disadvantages. For instance, it is generally 
necessary to use a host cell which lacks an active 
gene encoding the enzyme which can be inhibited. 
This tends to limit the number of cell lines which 
can be used with any particular co-amplification 
system. For instance, there is at present only one 
cell line known, which lacks the gene encoding DHPR . 
It would be advantageous if an effective 

CO-amplification system based on a dominant 
selectable marker which was applicable to a wide 
variety of cell lines could be provided. This would 
allow exploitation of -different processing and 
growth characteristics of a vaxieuy of cell lines. 

Attempts to use DHFR genes as dcirinant 
selectable marJcers in other cell lines has not 
proved entirely satisfactory. For instance, a 
HTX-resistant mutant DHFR or a DHFR gene under the 
control of a very strong promoter can act as a 
dominant selectable marker in certain cell types but 
such high concentrations of MTX are required that it 
has not been possible to achieve very high copy 
numbers by selection for gene amp lix* cation . 

Co-transf omants with an additional 
selectable marker also have disadvantages- For 
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instance, this can increase the complexity of 
plasmid construction and requires additional 
time-consuming screening of transformed cello to 
distinguish those clones in which the DHFR gene is 
active. 

A further disadvantage of the known 
co-amplif ication systems is that the DNA sequence 
encoding the inhibi table er, syne is generally not 
under post-translational control. The enzyme in the 
amplified system is therefore produced in large 
quantities, together with the desired protein. This 
could lead to lower levels of production of the 
desired protein. 

Another disadvantage of known 
co-aaplif ication systems is that resistance to the 
known drug car. arise from wee nanisms other than 
amplification. For instance/ in the DKFR/MTX 
system/ it is possible for a mutant DKPR gene to 
arise which produces a mutant DHFR which has a lower 
binding affinity for MTX than does w<ic-ty?e Of.ru, 
If such mutant DHFJR arises, cells conta i r.i r.g the 
gene which encodes it will be more resistant to MTX 
than the original host cell and will therefore be 
selected, even though no amplification has taken 
place. It is possible to select further to 
eliminate lines in which no amplification has taken 
place, but this is a time consuming process. 

Afurthe* disaCvantage of previcus selection 

systems for gene amplif icat inn is that toxic drugs 

are required. In particular MTX is a potential 

carcinogen. 

An additional disadvantage of previcus 

amplification systems is the need f*r repeated, 

time-consuming rounds of amplification, for example 

three or more -, to obtain maximum copy number. 
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Ic is an object of the present invention to 
overcome at least in part the disadvantages of the 
prior art systems for co-amplification. 

According to a first aspect of the present 
invention there is provided * recombinant DNA 
sequence which encodes the complete amino acid 
sequence of a glutamine synthetase (GS). 

Typically, the recombinant DNA sequence 
encodes an eufcaryotic, preferably majnmalian, GS. 
Conveniently, the recombinant DNA sequence encodes a 
rodent, such as mouse, rat or especially hamster, GS. 

Preferably, the recombinant DNA has the 
sequence of the amino acid coding portion of the 
sequence shown in Figure 2, and most preferably 
comprises the whole recombinant DNA sequence shown 

in Figure 2. 

The recombinant DNA sequence of this aspect 
of the invention includes such a sequence from one 
species which hybridises under high stringency 
conditions with another recombinant DNA sequence of 
this aspect of the invention or a part thereof from 
a different species. 

Cl ut amine synthetase (GS) is a universal 
housefceeping enzyme responsible for the synthesis of 
glutamate and ammonia using the hydrolysis of ATP to 
A£F and phosphate to drive the reaction, it is 
involved in the integration of nitrogen metabolism 
.with energy metabolism via the TCA cycle, glutamine 
being the major respiratory fuel for a wide variety, 
possible the majority, of cell types. 

GS is found at a low levels (0,0li-0.1% of 
soluble protein) in most higher vertebrate cells and 
is found at higher levels <>!% of total protein) in 
certain specialised cell types such as hepatocytes. 
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adipocytes and glial cells. 

A variety of regulatory signals affect GS 
levels within cells, for instance glucocorticoid 
steroids and cAMP, and glutamine in a < culture medium 
appears to regulate GS levels post-translationally 
via ADP ribosylation. 

GS from all sources is subject to inhibition 
by avariety of inhibitors/ for example vethionine 
sulphcvifisine (Msx>. This compound appearing to act 
as a transition state analogue of the catalytic 
process. Extensively amplified GS genes have been 
Obtained (Wilson K.H., Heredity, 49,, 181, (1982); 
and Young A. P. and 8iivgo1ri G.M., J. Biol. Chem. , 
2S8, 11260-11266, 1983) in variants of certain 
snararoalian cell lines selected for Msx resistance. 
Recently Sanders and Wilson (Sanders p.g. and Wilson 

The EMBO Journal, 3, 1, 65-71, 1984) have 
described the cloning of an 8.2 kb BglH fragment 
containing DNA coding fcr GS from the genome of an 
Msx resistant Chinese hamster ovary (CHC) cell line 
KG I.MS. Kcwever this fragment does not appear to 
contain a complete GS gene and it was not sequenced. 

Conveniently, the recombinant DNA sequence of 
this aspect cf the invention is cDNA, preferably 
derived by reverse transcription. However, the 
recombinant DKA sequence way alternatively or 
additionally comprise a fragment of genomic DNA. 

It will be appreciated that, in accordance 
with the present invention, a recombinant SNA 
sequence of the first aspect, or a fragment thereof, 
may be used as a hybridization probe for obtaining 
GS ceding sequences from other species. 
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Moreover, the recombinant: DNA sequences of 
the first aspect of the present invention may be 
used in »e<3ical or diagnostic methods, such as for 
detecting disease states in which the level of OS in 

a subject is altered. 

However/ it is envisaged that the main use of 
the recombinant DNA sequences of the first aspect of 
the present invention will h« in co-amplification or 
dominant selectable marfcer systems employed in 
recombinant DNA technology. 

Therefore According to a second aspect of the 
present invention, there is provided a recombinant 
DKA vector comprising a recombinant DNA sequence 
according to the first aspect of the invention. 

Preferably, the vector is an expression 
vector capable, in a transformant host cell, of 
expressing the CS-encoding recombinant DNA sequence. 

The vector may further comprise a recombinant 
DNA sequence which encodes the complete amino acid 
sequence of a desired protein other than GS. In the 
preferred case, the vector will also be capable, in 
the transformant host cell, of expressing the 
desired protein-encoding recombinant DNA sequence. 

Preferably, the GS-encoding recombinant dnA 
sequence is under the control of a regulatable 
promoter, such as a heat shock or a metalloth.ionein 
promoter. 

The present invention also provides a nest 
cell transformed witn a vector according to the 
second aspect of the invention. 

The vectors according to the second aspect of 
the present invention may be used in the 
co-amplification of non-selected genes. Therefore 
according to a third aspect of the present 
invention, there is provided a method for 
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co-amplif ying a recombinant DNA sequence which 
encodes the complete amino acid sequence of a 
desired protein other than CS which comprises s 

either co- trans forming a host cell with a 
vector according to the second aspect of the 
invention which does not contain a sequence encoding 
the desired protein , «*nd a second vector comprising 
said desired protein-encoding recombinant DNA 
sequence; 

or transforming: the host cell with a vector 
according to the second aspect cf the present 
invention which also includes a recombinant DNA 
sequence encoding the desired protein. 

There are a number of advantages to the use 
of the vectors according to the present invention in 
co-amplification of nor.-seicctsi genes. 

An advantage is that the GS gene is 
regulatable, for instance by addition of giutamine 
to the medium. It is therefore possible to amplify 
the GS gene anc the non-selected gene, and 
thendowr.-regulace the GS gene. The host cell will 
then accumulate much smaller quantities of active GS 
while still producing desirably large quantities of 
the required product. This also has the advantage 
of increasing the stability of the cell line, since 
there will be less selection pressure which could 
otherwise lead to instability in maintenance of 
amplified sequences in the cell line if the 
inhiOitcr is removed. 

Cell lir.es are known which lack the GS gene. 
Moreover, there are available schemes whereby such 
cell lines may be selected. These GS deficient cell 
lines may be used in the co-amplification procedures. 
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However, it has been surprisingly and 
unexpectedly shown that: GS expression vectors can 
also be used as effective dominant selectable 
markers in cell lines which contain an active GS 
9 ene by conferring resistance to certain levels of 
Ksx at which the frequencies of resistance caused by 
endogeneous gene amplif icaticn is minimal. It has 
been shown that sucn vectors can be amplified by 
increasing the concentration of Msx in the cell 
lines so that high copy numbers are achieved. These 
copy numbers are higher than achieved using previous 
amplification systems such as DHFR/MTX, and are 
achieved in only two rounds of amplification. The 
possibility cf attaining very high copy numbers is 
advantageous in ensuring that high levels of mRNA 
encoding the desired protein are obtained. 

It is believed, although the Applicants do 
not wish to be limited by this theory, that the 
effectiveness of GS as an amplifiable dominant 
selectable marker is a consequence of the relative 
expression levels of endogeneous- and vector-derived 
<3S genes. Selection for ger* amplification using 
ksx leads almost exclusively to the isolation of 
clones in which the vector-derived GS gene has been 
amplified in preference to the endogeneous 
gene.. when using host cells containing an 
endogeneous active GS gene, it is possible to 
Cacilitate selection by reducing or abolishing 
endogeneous GS activity, for instance by treatment 
of the cell line with dibutyr yl-cAMP and 
theophylline. A cell line which is susceptible to 
such reduction or abolition is the 3T3-L1 cell line. 

The desired protein whose recombinant DNA 
sequence is cc-amplified may oe, for instance, 
tissue plasminogen activator (*PA), although chis 
technique can be used to co-amplify recombinant ONA 
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sequences which encode any other protein, such as 
immunoglobulin polypeptides <iGs>, human growth 
hormone ChC«> ©r tissue inhibitor of 
netalloproteinases (TIMP) 

Preferably, the amplification is achieved by 
selection for resistance to progressively increased 
levels of a GS inhibitor, most preferably 
phosphinothricin or Msx. 

A further advantage of the present 
co-amplification procedure is that Msx is a cheaply 
available product of high solubility, it can 
therefore readily be used at high concentrations to 
enable selection of lines containing highly 

amplified sequences. 

Moreover, the effect of Msx can be 
potentiated by the addition to the selection medium 
of methionine. It is therefore preferred that in 
the present co-amplification procedure, selection is 
carried out in a medium containing methionine at 
higher than usual levels. Similarly, the effect of 
msx can be potentiated by lewer levels than usual of 
glutamate. 

If the G$-encoding recombinant DNA sequence 
in the vector used for co-amplif ieation is under the 
control of a regulatable promoter it is preferable 
for exoression cf the GS sequence to be switched©* 
during selection and amplification and subsequently 

down-regulated. 

in sotne cases, after co-amplification, the 
selected cell line may be dependent to'some extent 
on the GS inhibitor used in the selection 
procedure. If this is the case, the amount of gs 
inhibitor required may be reduced by adding 
glwtamir.e to the culture medium whei*by GS activity 
is post-translationally suppressed* 
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According to a fourth aspect of the present 
invention/ a vector according to the second aspect 
of the invention may be used to provide a dominant 
selectable marker by transforming a host cell with • 
the vector, thereby conferring trans-formant cells 
with resistance to GS inhibitors. 

The host cells which are use<5 in the fourth 
aspect of the present invention may contain an 
active 6£ gene. For the reasons set out above, it 
has been found that selection can still be achieved 
even where an active endogeneous gene is present. 
The advantages of using the vector of the present 
invention in co-amplif ication procedures are also 
shown in the use of the vectors as dominant 
selectable markers. 

It is preferred that the host cells used for 
the co-amplification procedures or selection for 
dominant marker procedures of the present invention 
are mammalian, most preferably namster, cells, and 
Chinese hamster ovary (CKO)-KI cells or derivatives 
thereof are particularly suitable. 

According to a fifth aspect of the present 
invention, the vectors according to the second 
aspect of the invention can be used in endowing a 
cell line with the ability to survive in a medium 
lacking giutarrine by transforming a host' cell 
completely lacking or reduced in CS activity an 
active GS gene with the vector. It is envisaged 
that this procedure will be particularly, but not 
exclusively, applicable to hybridoma and myeloma 
cell* and derivatives thereof. 

It has been found that the density to which 
certain cells, in particular myeloma *ells, can gro* 
in a medium may be limited by the requirement for 
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glutamine or by-products of glutaaine metabolism. 

If the cells could be made glut amine- independent 

either directly or as a result of additional medium 

* 

alterations, it is believed that greater cell 
densities in culture could be achieved, thereby 
increasing the amount of protein produced per 
culture volume by the cell line. 

It is* therefore believed that the use of 
recombinant DNA sequences encoding GS, for instance 
in vectors for co-amplif i cat ion » selection or 
transformation to glutaoine independence, will lead 
to highly flexible and advantageous syatecs which 
will be surprisingly superior to Other similar 
systems, for instance based on DHPR/MTX . 

The present invention is now described, by 
way of example only, with reterence to the 
accompanying drawings, in which: 

Figure 1 shows restriction maps of the GS 
specific cDNA inserts in pGSC4S, ^sgs 1.1 and Xgs , 
5.21 clones, in which it can be seen from the arrows 
that the nucleotide sequence of the coding region of 
GS was predominantly obtained from Ml 3 subclones of 
Xgs 1.1 and various regions confirmed using 
subclones of )sgs S.21 and pGSC45; 

Figure 2 shows the cdna (a:) and predicted 
amino acid (b?> sequences for the Chinese hamster GS 
gene, together with the published peptide sequences 
(c:) and peptide designations (d:) of bovine brain 
GS. The sequence (e; ) indicates the polyadenylation 
site used in Ag s 1-1- Amino acidresiduas are 
indicated as their single letter codes; 
non-homologous bovine residues are indicated in 
lower case letters. The ' /\* below base 7 
represents the start of the pGSC45 insert and the 
» » marker represents the priming sequence in 
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X gs 1.1 complementary to residues 1135-1X32. The 
•> • and 1 < ' symbols represent bases involved in 
stems of the calculated structure for the 5* 
untranslated region; 

Figure 3 shows the structure of three GS 
expression plasmids in which a) shows plasmid 
PSVLGS.1 (8.5 kb) containing a 4.75 kb GS mini gene 
under the control of the late region promoter of 
SV40 (L) cloned in the bacterial vector pCT54. The 
GS sequences include the compiere coding sequence, a 
single intron and approximately 2kb of 3' -flanking 
DNA spanning both of the presumed sites of 
polyadenylation, (b) shows plasmid pSV2.GS (5.5kb) 
containing 1.2 kb of GS cONA under the control of 
the early region promoter of SV40 (E) , the intron 
from the 7-antigen gene of SV40 and a sequence 
containing che early region poiyadenyla tion signal 
of SV40, and (c) shows plasmid p2!PCS (12,25 kb) 
containing the Hir.dlll-SamBI fragment from pSV2.GS 
(containing the GS coding sequence and SV40 intron 
ana polyadenyiation signal) cloned in the retroviral 
vector pZPIP Keo SV(X> in which latched blocks 
indicate irrelevant mouse DNA sequences, 5* and 3' 
LTRs as the long terminal repeats of Moloney Murine 
Leukaemia Virus (mmlv) , the filled block represents 
an SV4 0 fragment spanning the origin of replication 
oriented such that the SV4C early region promoter 
directs the expression of the gene from transpeson 
Tns which confers resistance to G413 in mammalian 
c*Us (,eo) and unmarked blocks contain additional 
DNA sequences from MMLV ? 

Figure 4 shows Southern blots of cell lines 
transfected with pSVLGS.l (Panel A) or pSV2 GS 
<**nei B). The fciot is probed with an RNA probe 
specific for SV40 cri S in : r^ion Dna. Panel A 
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representa a 2 hour exposure. Each lane contains 
2.5 ys genomic DNA from the following cell lines. 
Lanes 1 to 3 contain DNA from initial transf ectants; 
lane 1/ SVLGS2; lane 2, svlgs5; lane 3, Svlgss'. 
Lanes 4 t*3 6 contain D**A from- cell lines obtained 
after a single round of selection for gene 
amplification with Msx: lane 4 SVLGS2(500 jiMR) ? lane 
5, SVLGS5(2S0 jiMR); lane 6, SVLGS9 ( 500 jitoR ) . Lane 7 
contains DNA froxr. a coll line subjected to 2 rounds 
of selection for gene amplification, SVLGS5 ( 2mMR) . 
Panel B is an exposure of approximately two weeks. 
Each of lanes 1 to 6 contain 5 jz<i of genomic DNA and 
lane 7 contains 2*5 ^g . Lsr.es 1 to 3 contain DNA 
from initial transf ectar.t cell lines s lane 1, SV2 
GS20; lane 2, SV2.GS25: lane 3, SV2 GS30. Lanes 4 to 
6 represent cell lines after one round of selection 
in higher concentrations or msx: lane 4, 
SV2.CS20C100 pHR); lanes SV2 .CS25 (500 yMR > j lane 6, 
SV2,GS3C(500 ^iM*>. Lane 7 represents a cell line 
obtained after two rounds of selection in msx, 
SV2.GS3C(l0mKK) . 

Figure S shows a primer extension analysis of 
RKA derived from cell lines transfected with 
pSVLGS.l. A DNA oligonucleotide which binds to RNA 
at the presumed translation " start" was used to 
synthesise DNA from rotal una preparations. RKA 
preparations shown are? lane 1,. SVLGS2? lane 2 
SVLGS5; lane 3 a derivative of CK0-K1 resistant to 
30 jxH Msx (to indicate the extension from wild-type 
GS m£NA>; KW, pAT153 digested with Kpall molecular 
weight markers. 
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In the nucleotide and amino acid sequences 
shown in the accompanying drawings and in the 
description, the following abbreviations are used as 
appropriate • 0= uridine; G= guanosiae? T« thymidine; 
A» adenosine: C= cytosine; **** a termination codon; 
- denotes "an unknown nucleotide residue; A= alanine; 
C= cysteine; D= aspartic acid; glutamic acid; 
F» phenylalanine; G= glycine; K— histidine; 
i= isoleucine; f« lysine; L* leucine; M= methionine; 
N= asparagine; P" proline; 0= glutamine; 
R« arginine; S« serine; T- threonine; V- valine ; 
W* tryptophan; Y= tyrosine; X* an unknown amino 
acid; P3S=* phosphate buffered saline; S0S= sodium 
dodecyl sulphate; and EDTA= ethylene diamine 
tetraacetic acid. 
Example 

Csing a Multi-step selection procedure in a 
glutamine-free medium, a mutant line was derived 
from the Chinese hamster ovary (CHO) KG1 cell line 
{itself a derivative from the CH0-K1 line obtained 
as CCL €1 front the American Type Culture Collection, 
Rockville, HD , USA). The mutant cell line, labelled 
CHO-KGiMS, is resistant to 5roi: Msx. (The 
parentalceli line KC1 is only resistant to 3jiM Msx). 

A subclone, KC1MSC4-M, of the mutant cell 
line was used as a source of cellular DNA. Cells 
from the subclone were washed in PBS after 
trypsinization and pelleted at 2000 r.p.m. for 4. 
min. The pellet was resuspended in 100 mM Tris-HCL, 
pH 7.5, 10 IRK EDTA and lysed by the addition of SDS 
to 2%. RKase A was added to 50 ^ug/ml and the 
solution incubated at J7°C for 30 min. Protease K 
was added to 50 jig/ml and incubation continued at 
37 W C for from 30 min to 1 hr . The solution was 
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phencl extracted twice followed by two 
chloroform: isoarayl alcohol (24 ;1) extractions. The 
DNA was precipitated with isopropanol and then 
resuspended in 2 mM EOTA , 20 mM Tris^HCl, pK 7 . 5 and 
stored at 4°C. 

Genomic DNAs from parental FGl, mutants KGlHS 
and KG1MSC4-M, and reverter. t KG1MSC4-0 cells were 
digested with a variety of restriction 
endonucleases, subjected to agarose gel 
electrophoresis and Southern blotted onto 
nitrocellulose filters. These blots were probed 
with ©ligotdT)- primed cDNA made from parental KG1 
and mutant KG1MSC-4M poly (A) mSNAs. When wild-type 
KGl cDNA was used as a probe, a series of identical 
bands was seen across tracks from all cell lines. 
VJhen KG1WSC4-M mutant CDNA was used as a probe, the 
same common bands were seen across all tracks 
together with unique bands specific to mutant KGlMS 
and KG1MSC4-M genomic DMA. The bands common to all 
genomic DNAS were shown to be due to mitochondrial 
<mt) DNA, as determined by rsftt-.ri ction enzyme 
analysis of mtDNA purified from KG1 cells. The 
smallest DNA fragment identified which could contain 
the whole e»f the presumptive coding sequence for GS 
is an 8.2— kb Bglll fragment. On double digestions 
with PstI and Bglll, the two PstI fragments (2.1 kb 
and 2.4 kb) are seen to remain intact, indicating 
that both PstI fragments are contained within the 
sgi i i fragment. 

30 jig of XG1MSC4-M DNA was digested to 
completion with Bglll and the fragments separated by 
electrophoresis on an 0.3% agarose gel. The 
amplified 8.2 kb band was identified using ethitfium 
bromide staining and long wave ultra violet 
radiation by comparison with indlll and mtPstl 
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digests. The DNA band was eluted into a well cut 
into the gel and purified by phenol extraction, 
chloroform extraction and ethanol precipitated using 
carrier tRNA . Purified DNA was ligated with 
BamHi- digested, bacterial alkaline 
phosphatase-treated pUC9 (Vieira, J. and Messing, 
J., Gene/ 19/ 259-268, 1982). Recombinant DNA was 
used to transform E* Coli to ampicillin resistance 
and white colones on Xgal picked for analysis. 

ISO recombinant clones were obtained and DNA 
analysis of 11 of these showed that they all had DNA 
inserts of about 8.0 Kb. Differential colony 
hybridization and DNA spot hybridizations identified 
two recombinant clones which gave strong 
hybridization with a mutant KG1MSC4-M cDNA probe but 
no signal with a parental KGi cdna probe. Both 
recombinants pGSl and pCS2 produced the Psfcl 
restriction pattern expected from insertion of the 
retired Bglll restriction fragment. pGSl DNA was 
used to hybrid select GS m£NA from total cytoplasmic 
and poly (A) KG1MSC4-M RNA. The selected mRNA was 
translated together with KG1 and KG1MSC4-M total 
cytoplasmic HNA and C 35 Sl methionine-labelled 
polypeptides separated by SDS-PAG5. The major 
translation product ot pGSl selected roRNA is 
apolypeptidc of 42 000 kD MW which co-migrates with 
an amplified polypeptide in KG1MSC4-M translations. 
pGSl therefore contains genomic CHO DNA which 
contains at least part of the GS gene. 

This part of the work was carried out as 
described by Sanders and Wilson (loc. ext.). 

A 3.5 kb Hindlll fragment containing the 3* 
end of the GS gene from KG1MSC4-M was subcloned from 
pGSl into pUC9 to form plasmid pG5l!3 . 
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A cior.e bank was prepared by cloning a Sau3 A 
partial digest of KG1MSC4-M into the Ban.Br site of 

Recombinants were selected for hybridisation 
to pGSl. a BamHl-BcoRl fragment from a selected 
Ai-47 recombinant was subcloned into pUC9 to form 
plasmid pCS2 3 3 S (Kayward ct al., Nuc . Acid Res., l 4 
999-1008, 1986). ' 

CDNA libraries were made from KG1MSC4-M mRKA 
in P BR322 and *otl0 using standard procedures. The 
mRNA was converted to cDNa using ollgo-dT primed 
reverse transcriptase, and dsDNi. made by the RKase a 
procedure (Gubler, o. and Hof fmann, . V. , Gene, 25 
263-269, 1983). The dsDNA was either tailed wi-h C 
residue (Micnelson, a.m. and Orkin, S.H., J. bIoI 

Chem., 257, 14773-14782, 1982), annealed „ «„ tall ^ 
PBR322 and transformed into E. coli DHL or 
methylated end ligated to EcoRi liters. ti nkered 
DNA^was digested with EcoRi and linkers reeved bv 
Sephade* C7S chromatography in TOES (0.14 „ NaC l " 
0.01 « TriS, p H 7.6, 0.001 M EDTA. 0.1% SDS) 
tinkered dna in the excluded volume was recovered by 
etnanol precipitation and annealed to EcoRi-cu> 
AgtlO dma. Following in vitro packaging, 
recombinant phage was plat** on the high frequency 
lysogeny str ain E . coli Hfl (Huyhn, x.v. , y OU no R a 
end Davis, R . W ., in "DNA cloning techniques u- A 
practical approach < Ec . Glover, D.M.), X .*.i. Press 
Oxford, 1985). 

About 5000 colonies and 200.- plaques wer* screak 
on nitrocellulose filters using nick-translated 
probes derived from P vc subclones of GS genomic 
sequence. A l>b E coRI- 8g i ;i fragment from P GS2335 
was used as a 5' p rob e, and the entire 3.3 k B 
Hindin fragment of pGSin was used as a 3 • probe 
Plasmids from positive colonies were analysed by 
restriction digestion of small-scale 
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preparations of DNA and the longest clone (pGSC4S) 
selected for further analysis. 

Positive ^clones were plaque purified, 
grown up in 5000 ml of S.coli C600 liquid culture, 
and the phage purified on CsCl step gradients, dwa 
was prepared by formamide extraction (Davis, R.w., 
Bostein, D. and Roth, S.R., Advanced Bacterial 
Genetics, cold spring Harbor, 1980). Clones with 
the longest inserts were identified by SceSl 
digestion and inserts subcloned into pAT153 and 
M13mpll phage for further analysis and sequencing. 

The colonies or plaques were screened first 
with a probe derived f row the s ' «nd of the GS 
gene. Positive colonies or plaques from this 
analysis were picked and rescreened with a longer 
probe covering post of the 3' end of the gene, m 
this way it was anticipated that clones with long or 
possibly full length inserts would be selected and 
the tedious rescreening for 5» ends would be 
avoided. Several pl&smid clones and \$tiQ 
recombinants were derived by this means. Further 
analysis of or.e of the placid clones (pCSC<5> by 
restriction enzyme digestion and partial sequencing 
revealed that it had an insert of about 2.8ku and a 
polyA sequence at the 3' end. Northern blots 
indicate that a major mRKA for as is about this size 
(Sanders and Wilson, (loc. cit.)>, so the insert in 
PGSC45 was potentially a full length copy of this 
*KNA. The two \clones ( \«„ 1,1 and \gs S.21) had 
inserts of 1450 bp and li?o bp respectively. 
Restriction maps and *ligrw*nt of the cONA inse*** 
in pGSC4S, >gs 1.1 and 5,21 are shown in 

Figure 1. it is clear that the inserts in the 

clones are considerably shorter at the 3' end 
than the plasaid clone and m*y represent cDH* copies 
of one of the minor mKXAs. The insert in Xgs.l.l 
extends some 200 base pairs at the 5' end. 
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Th* nucleotide sequence of the mP.NA coding 
for glutamine synthetase was obtained from M13 
subclones of pGSC45 and EcoRI subclones of )sgs 1.1 
and )vgs 5.21 and IS shown in Figure 2. Some 
confirmatory sequence was also obtained from the 
genomic clone pGSl . Primer extension of GS mRNA 
with an oligonucleotide complementary to nucleotides 
147-166 gave a major extension product of 166 
nucleotides. This showo that pcso45 only lacks six 
or sevan nucleotides from the 5' end Of the mRNA. 
Nucleotide sequencing of the primer extended product 
by Maxam-Gilbert sequencing confirmed this although 
the first two bases could not be determined. 

Sequences at the 5* end of ^gs 1.1* which is 
some 200 bases longer at the 5 1 end than pGSC4S, 
showed considerable inverted homology to sequences 
at the 3' end of this clone (which was about ISO 
bases shorter at the 3* end than ^.gs 5.21, (see 
Fig. 1). These additional sequences are probably 
cloning artefacts, arising during second strand 
synthesis due to nucleotides 6 to 1 priming DNA 
synthesis via their complementarity to nueelotides 
1132-1137 despite the fact that the RNase H 
procedure was used. It cannot be excluded that the 
duplication arises from transcription of a modi tied 
GS gene, producing a modi tied mRNA which has been 
subsequently cloned, although the primer extension 
results did not suggest that there was any major 
mSNA species with a 5' end longer than 166 
nucleotides , 

The predicted amino acid sequence for CKO 
glutaxnine synthetase is shewn *in Figure 2. The NH 2 
terminus was identified by homology with the NH2 
terminal peptide found in bovine brain glutamine 
synthetase (Johnson, R.J. and Fiskiewicz, D., 
Bioehom, Biophyft . Acta, 827, 439-446, 1985). The 
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initiating AUG follows a precise CCACC upstream 
consensus sequence found for true initiation codons 
and is followed by a purine (i.e. CCACCATGG } . 
{Another AUG codon at position 14 is not in a 
favourable eontext by the same criteria and is 
followed by a termination codon in frame 21 
nucleotides downstream.) The predicted amino acid 
composition of the GS protein gives a molecular 
weight of 41,964 (not allowing for K-terminal 
aeetylation or other post-translational 
modifications ) , in agreement with other estimates. 
The basic nature of the protein is reflected in the 
excess of arginine, histidine and lysine residues 
over those of aspartate and glutamate. 

The predicted amine acid sequence shows 
excellent homology with bovine and other GS derived 
peptide sequences obtained by peptide sequencing, 
indicative of an accurate OKA sequence. <The amino 
acid sequence allows the ordering of all the 
cyanogen bromide peptides and most of the tryptic 
peptides published for bovine GS) . 

Tfte cao sequence also shows seme homology 
with the GS sequence from the cyanobacterium 
Anabaena, notably at residues 317-32S, (NRSASIRIP) 
which are an exact match to Anabaena residues 
342-350 . In addition/ related sequences can be 
found in glutamine synthetases isolated from plants. 

Access to complete cONA clones and genomic 
clones for Chinese hamster GS has not only allowed 
the amino acid sequence of glutamine synthetase to 
be predicted, but also allows a detailed analysis of 
the position of the introns within the gene and 
their relationship to the exons coding for the 
structural domains of the protein. 

A GS minigene was constructed from a cONA 
sequence (spanning the majority of the protein 
coding region) and a genomic sequence (which 
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recreates the 3 r end of the coding sequence). The 
3.4 kb ECoRI-Sstl fragment of pGSl encodes a single 
intron, all of the 3* untranslated region of both 
zsKNA species identified and contains about 2 Xb of 3* 
flanking DNA. This" DMA fragment was cloned between 
the EcoftI and Bam-ii sites of pCTS4 (Esatagc at al,, 
FNAS-U5A, $C, 3671-3675, 1983) to create pCTGS. The 
0-8 Kb EccRZ fragment of 1.1 was then inserted 

at the BccRl site of pCTGS in the correct 
crientation to recreate the 5 ' end of the gene. The 
late promoter of SV40 was cloned upstream by 
inserting the 342 bp PvuII - HindHI fragment of 
SV40, containing the origin of replication, at the 
Kindlll site of the above plasmid in the appropriate 
orientation to produce plasir.id pSVLGS-1 which is 
shown in Figure 3(a) 

An alternative GS expression construct was 
made by placing cDNA containing all of the GS coding 
sequences between sequences from sv<0 which direct 
efficient expression in tnaurcnalian cells. The 1.2 kb 
Mael-pvuII fragment of ^\gs 1.2 was cloned in place 
of dhfr sequences in pSV2.chfr, (Subramani, 
S.pMulliganr R . and Berg, P., Mol- Ceil. Biol M 1* 
854-864, 2951) between the nindllS and 3glII sit*ft 
to form pSV5.CS which is shown in Figure 3(b). 

In order to place GS coding sequences under 
the control of the Moloney murine leukaemia virus 

LTR promoter, the Hindi 1 - BactfSI fragment 
from pSV2.CS < roe Figure 3b. v introduced at the 

3amKX site of p2IP-NeopSV (X) (Cepko, C.L., Roberts, 

B. £- and Mulligan, R.C., Cell, 37, 1053-1062, 19M). 

The 3.0 kb Kindlll - BamKI fragment of pt?A 
3,15 (Stephens, P.E., Bcndig, M.M. and ffentschei, 

C. C. * manuscript in preparation) contains a cDNA 
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coding for tissue plasminogen activator, downstreo* 
of which is the SV40 small t-intron and the ^ 
polyadenyiation signal fro* the early region 
transcript of SV40. This fragment was cloned in a 
3-way ligation with the 342 bp SV40 PviilX - ■!»«« 
figment into the BamHI site of pSVLGS.l so that the 
tPA gene was under the control of the SV4 0 early 
promoter. This generated two plasmids, pSVLOStFAl-6, 
in which the OS and tPA transcription units are in 
tandem and P 5VLGStPAl7, in which the two genes are 
in opposite orientations. 

CHO-K1 cells, obtained from ATCC, were grown 
in Glasgow modified Eagle's medium (GMEM) without 
glutamine and supplemented with 10% dialysed foetal 
calf serum (GIBCO), 1 mM sodium pyruvate, 
non-essential amino acids (alanine/ aspartate, 
glycine and serine at 100 f*, asparagine, glutamate 
and proline at 500 jiM) and nucleosides (adenosine, 
guanosine, cytidine and uridine at 30^ «nd 
thymidine at 10 jiM). For selection, L-methionine 
sulphoximine (m« from Sigma) was added at 
appropriate concentrations* 

Approximately 3*10* cells per 100 mm. petri dish were 
transfected with 10 circular plasmid dka 
according to the calcium phosphate co-precipitation 
procedure (Graham., P.L. and van der Eb, A.J., 
Virology, 52, 4SS-467, 1983). Cells were subjected 
to a glycerol Shock (15% glycerol in serum-free 
culture medium for 2 minute*/ 4 hours after 
trar.sfection (Frost, 8. and Williams, J., Virology, 
91, 39-50, 1978). One day later, transfected cells 
were fed with fresh selective medium and colonies of 
surviving cells were visible within 2-3 weeks. 
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tPA activity in cell culture eupernatants vas 
measured using a f iferin-acarose plate assay using a 
tPA standard (Biopool) for comparison, Attached 
eells were typically washed in serxaa-free medium and 
incubated for 18-20 hours in serum-free medium at 
37*C. After removal of medium samples for assay, 
the cells were trypsinised and viable cells 
counted. Results were then expressed as units of 
tPA/10 6 cells/24 hours. Colonies cf cells in petri 
dishes were assayed for tPA production by overlaying 
directly with a fibrin agarose gel. 

In the glutamine-f ree medium used in these 
experiments, the specific GS inhibitor, Msx is toxic 
tc CS0-K1 cells at concentrations above 3 jxM. To 
test whether the CS expression plasmids could 
synthesise functional GS in vivo, each plasmid was 
introduced into CH0-K1 ceils by calcium phosphate - 
mediated transfecton and tested for the ability to 
confer resistance to higher concentrations of Msx. 

Resistance to Msx can, however, also arise by 
amplification of the endogenous GS genes (or perhaps 
by other unknown mechanisms). Therefore, in order 
for a GS expression vector to be useful as a 
dominant selectable marker, it must confer 
resistance to a particular concentration of Msx with 
a greater frequency than the frequency of 
spontaneous resistant mutants. The frequency with 
which spontaneously resistant clones are detected 
defends on the concentration of Msx used for 
selection. Thus, for instance gene amplification in 
CHO-K1 cells leads to approximately 1 surviving 
COlony/10 4 cells plated in 10 ^uM Msx, but this 
frequency felines to less than 1/10 7 if cells are 
selected for resistance tc 25 jiii Msx. 
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Since the frequency of transfection of CHO 
cells using the calcium phosphate co-precipitate 
technique is generally reported to be less than • 
1/103. a range of Msx concentrations was chosen for 
selection in excess of 10 f*. The results in Table 
1 show that transfection with any of the three CS 
expression plasnids leads to survival of a greater 
nuabcr «,£ Msx-resistant colonies than the background 
frequency detected in »ock-transfected cells when 
selected at IS UK or 20 uM Msx. 

pZIPGS yields only a slight increase in the 
number of surviving colonies above -background. This 
vector would therefore be a poor selectable taarker 
and was not studied further. pSV2.CS and psvws.l, 
however, both appear to act as effective dominant 
selectable markers in this cell line. The frequency 
with which resistant colonies arise after 
transfection with either plasmid in these 
experiments is at least 25 tis.es the frequency due 
to endogenous amplification if selection is carrxed 
out at 15-20 um Msx. Apparent transfection 
frequencies for pSV2.GS of up to 3.8/10* cells and 
for pSVLGS.l of up to 2.S/10S cells were 
observed. The differences in apparent transfection 
frequencies between the three plasirids are likely to 
reflect differences in the efficiency with which the 
GS gene is expressed in the above three vectors. 

An independent estimate of transfection 
efficiency car. be obtained i<. the case cf pZIPGS 
since the vector also contains a neo gene which 
confers, resistance to the antibiotic G418. 
Selection with G418 instead of Msx yielded a 
transfection frequency substantially higher than 
obtained by selection in 14-20 Msx (see Table 1), 
indicating'that the vector is being taken up by the 
cells and reinforcing the view that the CS gene is 
relatively poorly expressed »n tins vector. 




TABLE 1 



Apparent transaction frequencies of 
constructs in CHO-K1 cells were determined by the 
numfcer of surviving colonies/10* transfected cells 
at various concentrations of Msx, or by resistance 
to 0.8 mg/al 0418 (Results are from 3 transf ections, 
(i), (ii> and (iii>). 



Vector 

pSVlsGS .1 U ) 
(iii) 



(ii) 

Mock (i) 
(ii) 
(iii) 



15 uM 20 2Sfi^ 

13.6 9-2 

24.5 10-0 



pSV2.GS(i) 26.4 
(ii) 
(iii) 



18.0 12.0 
32.0 7,4 
38.0 29.0 



pZlPGS (i) 0.72 0.5 0 



1.1 0 

0.47 0.24 0 

0.29 0 

-1.0 o 
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TABLSlCOAtd 



30 



2.4 



100 ^J* 
0 . 24 



1 mM g_418 



12.0 



1.4 



30 
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that the generation ©£ 

three P* e ^ M conta in vector DNA. 

Msx-resistant cells *«° la produced in these 

Secondly, novel GS J ^ 

will direct the fonnat^on of « *» GS ^ 

^ e.K« end from the natural 

follows. established from 

Three cell Une* «ere trans£ection with 

ln aivid U .l colonies a£t f ! r oro =olenies 

pSVWS.l and three cell lxnes f.c 

transected with *v..« cell U". ^ § ^ „ 

^ Msx - C ' U J * 25 and 30 U» Hsx respectively . 
are resistant to 20, o£ th . M «.u 

DNA was prepared from each ox 

v mo- o£ the DNA sastples vas 

indicates that all of .he msx 

contain vector DNA. Th e estiB(a ted by 

vector pr«**«* in •***> «•« ' an ° Bt _ n<3ard 
caparison with Known amounts of a 

preparation - .^^^«T W . cell 
Fro* this, it is clear that ell ^ 
lines contain multiple cop.es of the ve 
about S00 copies per cell (see Table ZU 
SV2.6S cell lines also contain vector DNA 
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bu* la .11 *hree «ses there seems to have been 

««Vj . ninele copy of vector DNA per 
integration of oniy a singie cw^r 

It is to be noted that the result obtained 
with psvws.l is highly unexpected. Up until the 
present there has been no reported case in wtue.. 
such a high copy number has been produced nerely by 
transaction. It is believed that this high copy 
number is due to «h. presence in the vector of a DhA 
sequence which favours the incorporation of h ^ 
numbers of copies of vector DNA into the host cell s 

Such high copy numbers of integrated vectors 
hove not been observed with pSV2.GS. It is 
therefore believed that DNA sequences partly 
responsible for the high copy number transfection 
are found either in the intron or in the 3' region 
cf the oenomic GS DMA part of the pSVWS.l vector or 
adjoining vector sequences. However, the copy 
number probably also reflects the expression level 
required to attain resistance to the particular 
level of Max used for selection.. 

Clearly, this high copy number transfection 
sequence will be of use not only with GS encoding 
sequences but also with other protein sequences, 
such as those encoding selectable markers or 
awplifiable ger.es because it provides a means of 
increasing copy number and hence expression levels 
of desired genes additional to the efiects of 
selection for further gene amp? ! . eieation. 

Therefore according to a further aspect of 
the invention there is provided the recombinant DNA 
sequence present in the pSVLGS.l vector which is 
responsible for achieving high copy number 
transfection of vector OVA into a host cell or any 
other recombinant DNA sequence which will provide 
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the same function ♦ 

The S- ends of GS mJRNA produced by 
*«-re S i stent cell lines were analysed by primer 

"length was synth.eised ^ioh hybrids* » 
LlL of the nORNA ne»r the start of the protein 

"gion. -verse transcribe should extend 
this Primer to a length of 146 bps from wild type GS 
*»d to a length o f a pprcxi»ately .00 bps t, the 
start Of transcription in the case of pSVLGS.l 
Zl The RSA predicted fro* P SV 2 .GS is shorter 
^an the natural mRNA and so could be masxea by 
'orop-offs- in the primer extension reaction and was 

not analysed. . . _ r<: 

The results shov n in Figure S show thet a GS 
specific mRSA longer than wild-type mRHA is indeed 
Produced in SV W S cell lines, stron 8 ly supporting 
the conclusion that the transacted gene is 
transcribed in these cells. The reverse 
transcriptase dees not extend the primer to the 
predicted length, but seems to drop off at at leas 
"major sites- probably due to inhibition of reverse 
transcription by secondary Structure in the 5 
untranslated region of this RMA • 

Three Ksx-resistant cell lines transfecte* 
with pSVLGS.l and three cell lines transected with 
PSV2.GS were grown in v«i=u S concentrations of MM 
in crdor to select for GS gene amplification. For 
each cell line, approximately XO* cells w.re plate, 
in 100 j*, 2*0 p. 500 jM an. 1 ™ ««• » 
dayS , the maxunu* concentrations of «sx at which 
surviving colonies, could be observed 
line vere as follows, SVLGS2, 500 J»M: SVLGSS. 2S0 
uM } SVLGS9 , 500 ^ S SV2.GS20, 100 J*L t SV2.GS2S, 500 
Jm, end SV2.GS30, 500 ji*. The *ost highly resistant 
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colonie* obtained £r« each cell line were pooled 
and two of these Ms*-resistant pools -ere >^«tri 
to a second round of amplification, svlgsz (500 
UMB) and SV2.GS30 (500 were plated out at 1 tff. 

5*K, 10 »M and 20 «M Ms*. After 1S-20 
colonies appeared on plate, containing S^GS2(500 
ukr) at up to 2 IK Msx and in the case of SV2.GS 
(500 uMR> at up to 10 mM Msx. From these, two 
highly resistant cell lines SVGS2 (2 mWO and 
SV2.CS30 (10 OMR) were established. Bach of these 
highly resistant cell lines contain ceils which have 
arisen from multiple independent amplification 

events. , , . 

A Southern blot of DNA prepared from all O- 

the Msx-resi stent cell lines was hybrised with a 
probe specific for SV40 ORl-region DNA. The results 
of this are shown in Figure 3. From a comparison 
with standard preparations of plasmid DMA, the copy 
numbers could be determined and these are shown in 
Table 2. 

After the first round of selection, all three 
SVLGS cell lines show approximately a 10-fold 
increase in copy number of the vector DNA. 
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TABT.E 2 

Copy Nuxnber of Transfected Genes Subjected to 
Selection for gene, amplification 



Cell Line Cone, of Copy tfuir.ber 

Msx ()aM> 

SVLGS2 20 1 7 0 

SVLGS5 20 2S 

SVLGS9 30 SOO 

SVLCS2(500 pWR) 500 1200 

SVLCS5(250 ^uKR ) 250 300 

SVLCS9(S00 JJMR) 500 4200 

SVLGS2(2 inMR) 2000 15000 

SV2.GS20 20 1 

SV2.GS25 25 1 

.SV2.CS30 30 1 

3V2_CS20U00 jaMR) 100 1 

SV2.GS2S(500 /iMR) 500 1 

SV2.GS30CS00 ^oMJO 500 1 

SV2.GS30U0 mMJR) 1000 S>-iQ 
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In the second round of selection, SVLGS 2 
shows at least a further 10 fold amplification 
attaining approximately 15,000 copies/cell. 

In marked contrast, the single copy of 
PSV2.GS present in initial transfectants is not 
significantly increased after a single round of 
selection and SV2.GS30(10mMK) resistant to 10 mM msx 
contains only 5-10 copies of the vector in each cell. 

in order to determine whether there has also 
been amplification of the endogenous GS genes, the 
probe was removed and the blot re-probed with a 
nick-translated Bgll-Bglll DNA fragment obtained 
f row tfto third intron of the GS genomic sequences. 
This probe is therefore specific for endogeneous GS 
genes and does not hybridise with the transfected 
genes which lack this intron. No significant 
endogenous gene amplification could be detected by 
this means in SVLGS cell lines. A small degree of 
endogenous amplification could be seen in 
SV2-GS30(10mMfc) cell DKA* 

Thus pSV2.GS, while acting as an effective 
dominant selectable marker in CKO-Kl cells, appears 
to express GS too efficiently to be suitable as an 
amplifiable marker, since very high levels of Msx 
are required in order to select for even slightly 
increased copy number. pSVXCS . 1 on the other hand 
can toe used as a dominant selectable marker and can 
also be amplified to very high copy numbers. 

The suitability of pSVLGS.l as a selectable 
and amplifiable vector was tested by introducing 
into it a transcription unit capable of expressing 
tissue-plasminogen activator (tPA). Two plasmids 
were examined in which tPA cDNA under the control of 
the SV40 early region promoter and polyadenylation 
signal was cloneo at tne unique Bamhl Site of 
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p svx*s.l- m P svLGS.tPA16, the GS and tPA genes are 
in the mm orientation and in pSVLGS.tPAi?, the two 
genes are in opposite orientations.. 

Both constructions were introduced into 
CHO-KI cells and transferred cells were selected for 
resistance to 15 uM Ms*. After 10 days, the 
surviving colonies were screened for tPA activity by 
fibrin overlays. Many of the surviving colonies 
secreted tPA, thus confirming that the GS gene could 
act as a selectable marker to identify transfected 
clones. The tPA-induced clearings in the fibrin gel 
were larger and more numerous on plates transacted 
with pSVGS.tPA 16, indicating that the tPA gen* was 
pore efficiently expressed when in the same 
orientation in the vector as the GS gene than when 
the two genes were in opposite orientations. 10 
colonies from a transfection with pSvLGS . tPAl€ , 
which produced large tPA darings, were grown in 
96-well plates. Of these, the two cell lines 
secreting the highest levels of tPA, 16-1.20 jxM* and 
16-2.20 J** were selected for further study. Each 
was subjected to selection in increased 
concentrations of Msx and the t?A . production from, 
pools of colonies obtained at different stages is 
shown in Table 3. 
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TABLE 3 

Cell line tPA secreted (U/1Q & cells/24 hours) 

16-1.20 jxM* 260 
16-1.200 jaMK 2700 

16-2.20 fMR 400 
16-2.200 jiMR 2750 
16-2.10 mWR 4000 

16-2.10 31MR, the cell line producing the highest 
levels cf tPA, was cloned by limiting dilution and a 
clone was isolated which secreted 4000 U/10 6 
cells/day. This level is comparable with the 
highest level of tPA expression reported using dhfr 
co-Amplification, 
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It has thus been shown that, when a GS cdna 
cloned in the retrovirus based vector p2I?-Neo SV(X) 
was used, the frequency with which Msx-resistant 
colonies arose was low, probably due to relatively 
inefficient expression from this vector in this cell 
line. On the other hand, two different constructs 
in which the GS gene was under the control of SV40 
promoters gave rise to cells resistant to 
substantially higher levels of Msx than wild-type 
cells. All of the resistant colonies tested 
contained vector DMA, and nc^vel mAKAs consistent 
with transcription of the transfected genes could be 
detected in cell lines containing pSVLGS.1 DNA. 
Msx-resistant colonies could be identified using 
both OS expression plasmids using SV4 0 promoter* at 
a frAquency greater than 1/10 5 cells , indicating 
that both constructs could be useful as dominant 
selectable markers for the introduction of cloned 
dka into cho-ki cells. 

The expression plasmid pSVLCS.l containing a 
GS ainigene utilising its cwn KNA processing signals 
and under the control of an SV4 0 late promoter, can 
unexpectedly be used to introduce a high number of 
copies of the vector into each transfected cell. 

Both GS cjenes under the control of SV4 0 
promoters were capable of further amplification when 
transfected cell lines were selected in higher 
concentrations of .Msx. Cell lines expressing 
pSV2.CS yielded variant clones resistant to very 
high levels of Msx (up to t- times higher than 
originally used to select transf ectants) with an 
increase in copy number to only 5-10 per cell. 
There was little detectable concomitant 
amplification of endogenous genes. 

pSVtGS.l is a much more suitable amplifiable 
vector since the increase in copy number was roughly 
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proportional to the concentration of Msx and very 
high copy numbers were achieved (approximately t 
10 f 000 copies per cell in cells resistant to 2 mM- 
Msx)* In this case t no detectable endogenous gene 
aaplif ication occurred. 

The pSVLGS-l ampiifiable vector has been used 
to introduce a tPA gene into CHO-Kl cells and it has 
been shown that gene amplif ication leads to higher . 
levels of t?A expression* Variant clones resistant 
to ten times the concentration oi Msx of the 
original tran?f ectants secrete about ten times the 
amount of tPA, but a further SO fold increase in 
Msx-resistance led to less than a 2 fold increase in 
tPA secretion. This suggests tiiat some aspect of 
the synthesis or secretion of tPA is close to 
saturation in these highly Msx-resistant cells. The 
maximum level of tPA secretion of 4000 U/10 6 
cells/day in the 16-2.10 mMR cell line is comparable 
with the levels of expression previously observed in 
dhfr* CHO cells using DHFR-mediated gene 
ainplif ication, the highest reported level of 
secreticn being 6000 U/10$ cells/day. This also 
supports the conclusion that tPA secretion is close 
to the maximum attainable by current methods in 
these cells. 

It will be appreciated that the present 
xnver.tion is described above purely by way of 
illustration and that modifications apd variations 
thereof may be made by the peison skilled in the art 
without departing from the spirit and scope thereof 
as defined in the appended claims- 
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C LA I MS 

I. A recombinant DNA sequence which encodes the 

complete amino acid octanes of a glutamine 
synthetase (GS). 

2 The recombinant DNA sequence of claim 1, 

which encodes the complete amino acid sequence of an 

eukaryotic GS. f 

* The recomhinant DNA sequence Of claim 1 , 

which encodes the complete amino acid sequence of a 

mammalian GS. ■ ■ 

4 The recombinant DNA sequence of claim 3, 

which encodes the complete amino acid sequence of a 

rodent GS. . 

5. The recombinant DNA sequence of claim 4, 

which encodes the complete amino acid sequence of a 

hamster GS. ■ 
6 The recombinant DNA sequence of claim. 5, 

which comprises the amine acid coding portion of the 
sequence shown in Figure 2. 

7. The recombinant dna sequence shown in Figure 

\\ A recombinant DNA sequence from one species 

. which hybridises under high stringency conditions 
with the recombinant DNA sequence of any one of 
Claims 1 to C or a part thereof from a different 
species. 

9, The recombinant DNA sequence of any one of 

claims 1 to 8, which is cDNA . 

10 The recombinant DNA sequence of claim 9 

vhcrein the cnNA is derived by reverse transcription 

11. The recombinant DNA sequence of any one of 
claims 1 to 10, which comprises a fragment of 
genomic DNA* 

12. use of the recombinant DNA sequence of any 
one of claims 1 to t> or any fragment thereof as a 
hybridisation probe. 
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13 The recombinant DNA sequer.ee of any oae of 
claims 1 to U for use in medical or diagnostic 
methods such as for detecting disease state. « 
which the level of GS in a subject is altered. 

14 A recombinant M»A vector comprising the 
hmUmw. DSA sequence of any one of claims 1 to 

15 The vector Of claim 14, which is an 
expression vector capable, in » transferor* host 
cell, of ^pressing the recombinant WK sequence of 
any one of claims 1 to 11. 

16 The vector of claim 14, further composing a 
recombinant DNA sequence which encode* the complete 
«„ino acid sequence of a desired protein other than 

U The vector of claim 15, further comprising a 

recombinant OKA sequence which encodes the complete 
aaino acid sequence of a desired protein other than 
GS, the vector also being capable, in a transferor 
host cell, of expressing the recombinant DMA 
sequence for the desired protein. 

18 The vector of claim 15 or claim 17, wherein 

the GS-cncodir. s recombinant DNA sequence is under 
the control of a regulatable promoter. 

19. The vector of claim 18, wherein the 

« , l... shock promoter or a 

regulatable promoter is a nee- SDOC * 

thetallothionein promoter. 

20. Plasmid f^VLSS.l. 

21. Plasmid pSV2.GS. 

22. Plasmid pZIPGS. 

23. Plasmid pSVLCS.tPAl6 
24.. Plasmid pSVLGS.t?A17 

2S. A host cell transformed with a vector 
according to any one of claims 14 to 24. 
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2$. A method for co-amplifying a recombinant DNA 
sequence which encodes the complete amino acid 
sequence of a desired protein other than GS, which 
comprises co-transf orming a host cell with a vector 
according to claim 15, claim 18 or claim 19 when 
dependent on claim 15 , or any one of claims 20 to 
22, and a vector comprising said desired protein 
recombinant DNA sequence, 

27. A Jnethod for co-amplifying a recombinant DNA 
sequence which encodes the complete amino acid 
sequence of a desired protein other than GS which 
comprises transforming a host eel) with a vector 
according to claim 17, claim 18 or claim 19 when 
dependent on claim 17, claim 2 3 or claim 24. 

28 ♦ The method of claim 26 or Claim 27, wherein 
the desired protein is tissue plasminogen activator. 

29. The method of any cne of claims 26 to 28, 
wherein amplification is achieved by selection for 
resistance to progressively increased levels of a GS 

* inhibitor. 

30. The method of claim 29, wherein the gs 
inhibitor is phosphinothricir. or methionine 
su lphoximine . 

31. The method of claim 29 or claim 30, wherein 
after amplification, the level of GS accumulation 

is reduced by adding glutamine to the culture medium. 

32. The method of any one of claims 29 to 31, 
wherein the amount of GS inhibitor required 
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to cause amplification is reduced by the addition of 
methionine to the culture medium. 

33. The method of any one of claims 26 to 30 when 
dependent on claim 18 or claim 19, wherein the 
GS-encoding recombinant DKA sequence expression is 
switched on during selection and amplification and 
subsequently down-regulated. 

34. L'se of a vector accordins to any one of 
claims 15 and 17 to 24 as a dominant selectable 
marker by transforming a host cell which contains an 
active GS gene with the vector, tliereby conferring 
transformant cells with resistance to GS inhibitors. 

35. Use of a vector according to any one of 
claims 15 and 17 to 24 in endowing a cell line with 
the ability to survive in a medium lacking glutamine 
by transforming a host cell either completely 
lacking or reduced in GS activity with the victor. 

?6- The method of any one of claims 26 to 34, 
wherein vh« host cell is a mammalian cell. 

37. The method of any one of claims 26 to 34, 
wherein the host cell is a ckq-ki cell. 



38. The. method of claim 35, wherein the host cell 
is a myeloma cell. 
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